Delving into Sample Loss Curve to Embrace Noisy and Imbalanced Data
نویسندگان
چکیده
Corrupted labels and class imbalance are commonly encountered in practically collected training data, which easily leads to over-fitting of deep neural networks (DNNs). Existing approaches alleviate these issues by adopting a sample re-weighting strategy, is re-weight designing weighting function. However, it only applicable for data containing either one type biases. In practice, however, biased samples with corrupted tailed classes co-exist data. How handle them simultaneously key but under-explored problem. this paper, we find that two types samples, though have similar transient loss, distinguishable trend characteristics loss curves, could provide valuable priors weight assignment. Motivated this, delve into the curves propose novel probe-and-allocate strategy: probing stage, train network on whole without intervention, record curve each as an additional attribute; allocating feed resulting attribute newly designed curve-perception network, named CurveNet, learn identify bias assign proper weights through meta-learning adaptively. The speed meta learning also blocks its application. To solve it, method skip layer optimization (SLMO) accelerate skipping bottom layers. Extensive synthetic real experiments well validate proposed method, achieves state-of-the-art performance multiple challenging benchmarks.
منابع مشابه
Revised Loss Bounds for the Set Covering Machine and Sample-Compression Loss Bounds for Imbalanced Data
Marchand and Shawe-Taylor (2002) have proposed a loss bound for the set covering machine that has the property to depend on the observed fraction of positive examples and on what the classifier achieves on the positive training examples. We show that this loss bound is incorrect. We then propose a loss bound, valid for any sample-compression learning algorithm (including the set covering machin...
متن کاملDiscrete Polynomial Curve Fitting to Noisy Data
A discrete polynomial curve is defined as a set of points lying between two polynomial curves. This paper deals with the problem of fitting a discrete polynomial curve to given integer points in the presence of outliers. We formulate the problem as a discrete optimization problem in which the number of points included in the discrete polynomial curve, i.e., the number of inliers, is maximized. ...
متن کاملDelving into Transition to the Semantic Web
The semantic technologies pose new challenge for the way in which we built and operate systems. They are tools used to represent significances, associations, theories, separated from data and code. Their goal is to create, to discover, to represent, to organize, to process, to manage, to ratiocinate, to represent, to share and use the significances and knowledge to fulfill the business, persona...
متن کاملFeature Curve Co-Completion in Noisy Data
Feature curves on 3D shapes provide important hints about significant parts of the geometry and reveal their underlying structure. However, when we process real world data, automatically detected feature curves are affected by measurement uncertainty, missing data, and sampling resolution, leading to noisy, fragmented, and incomplete feature curve networks. These artifacts make further processi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2022
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v36i6.20661